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The present invention relates to polypeptides encoded by the human polymorphic epithelial mucin gene and to 
the use of the polypeptides in the therapeutic treatment of cancer and in in vitro diagnosis. 

Normal and malignant human mammary epithelial cells express high molecular weight glycoproteins (gps) which 
are extensively glycosylated and very antigenic. As a result, many of the monoclonal antibodies (MAbs) selected for 
reactivity with human breast cancer and other carcinomas are found to react with molecules which are produced in 
abundance by the fully differentiated human mammary tissue and are found in the milk fat globule (MFG) and in milk. 
However, the level of expression of a particular antigenic determinant may be different in the gps produced by the 
normal differentiated cell and in the similar molecules produced by breast cancers. This means that some antibodies 
can show a certain specificity for reacting with tumour gps. 

The molecules bearing the epitopes recognised by these antibodies are complex and have been difficult to analyse, 
both because they are large and heavily glycosylated (>250,000 daltons) and because of the complex pattern of ex- 
pression. Two of the MAbs, HMFG-1 and -2, react with a component in human milk which appears to be greater than 
400,000 daltons, whereas the molecules found in sera and tumours are smaller, although the dominant components 
are still greater than 200,000 daltons on immunoblots. The large glycoprotein produced by the differentiated mammary 
epithelial cells found in human milk or in the milk fat globule has been purified and shown to have some of the char- 
acteristics of the mucins. This component contains a large amount of carbohydrate joined in O-linkage to serine and 
threonine residues via the linkage sugar N-acetylgalactosamine. Moreover, the core protein contains high levels of 
serine, threonine and proline and low levels of aromatic and sulphur containing amino acids. 

These mucin-like glycoproteins are also secreted by a number of other normal epithelial cells. The monoclonal 
antibody HMFG-1 is highly reactive with the milk mucin and evidence suggests that the epitope recognised by this 
antibody is more abundant on the fully processed mucin, characteristic of normal differentiation. 

In tumours, the molecular weight of the molecules carrying these antigenic determinants differs among individual 
tumours and, in the case of the components recognised by the HMFG-2 antibody, can range from 80-400K daltons. 
Although it appears that the differences observed in the mobility of the high molecular weight bands are due to genetic 
polymorphism this probably does not explain variations in the size of the lower bands. It has been proposed that these 
may be the result of aberrant processing occurring in the tumour cell possibly within the glycosylation pathways. 

For the majority of the monoclonal antibodies reacting with this group of molecules the exact nature of the antigenic 
epitopes remains unclear but circumstantial evidence has suggested that carbohydrate may at least be partly involved 
in many of the epitopes. Moreover, from previously available data it was not known whether the mucin found in the 
normal differentiated cells, and that observed in the tumours, contain the same core protein, or just carry common 
carbohydrate determinants. 

Mucin has now been isolated from human milk by affinity chromatography enabling identification of the core protein 
and the gene encoding the protein. This has been found to be a highly polymorphic gene defined by the peanut urinary 
mucin (PUM) locus [see Swallow et al., Disease Markers , 4, 247, (1986) and Nature, 327 , 82-84 (1987)]. The gene 
product, which is hereafter referred to as human polymorphic epithelial mucin or HPEM, has been detected in breast 
tumours and other carcinomas as well as in some normal epithelial tissues. 

Rittenhouse et al (Laboratory Medicine, 16(9), 556-560 (1985)) review the field of mucin-type glycoproteins as 
tumour markers and argue that the distinction between normal and tumour mucins is associated with different patterns 
of glycosylation. 

Ormerod et al (Br. J. Cancer (1983), 48^ 533-541) have partially purified epithelial membrane antigen (EMA), a 
soluble component of human milk, and have partially deglycosylated this by chemical means. 

Sekine et al (J. Immunol. 135, 3610-3615 (1985)) describes the murine monoclonal antibody DF3 and character- 
isation of its human breast carcinoma-derived antigen as a mucin-like glycoprotein having several sialyl oligonucle- 
otides on a peptide backbone, based on studies including enzyme and chemical digestions of the antigen. 

Gendler et al (J. Cell Biology US (1986), 103(5), part 2 abstract 93) discusses the stripping of glycosylation from 
human milk mucin, antibodies raised against the core protein and their use in probing a cDNA expression library. 

It has now been found that the HPEM core protein has epitopes which also appear in the aberrantly processed 
gps produced by adenocarcinoma cells. Certain of these epitopes are not exposed in the fully processed mucin glyc- 
oprotein produced by the lactating mammary gland. 

Antibodies have been made against the HPEM core protein that react with the core protein, especially as expressed 
by colon, lung, ovary and particularly breast carcinomas, but have reduced or no reaction with the corresponding fully 
processed HPEM, particularly as produced by the normal lactating human mammary gland. 

Such antibodies have no significant reaction with the mucin glycoproteins produced by pregnant or lactating mam- 
mary epithelial tissues but react with the mucin proteins expressed by mammary epithelial adenocarcinoma cells. 
These antibodies show a much reduced reaction with benign breast tumours and are therefore useful in the diagnosis 
and localisation of breast cancer as well as in therapeutic methods. 
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The antibodies may be used tor other purposes including screening cell cultures tor the polypeptide expression 
product ot the human mammary epithelial mucin gene, or fragments thereof, particularly the nascent expression prod- 
uct. In this case the antibodies may conveniently be polyclonal or monoclonal antibodies. 

These antibodies may be produced by innoculation ot suitable animals with HPEM core protein or a fragment 
5 thereof such as the peptides described below. Monoclonal antibodies are produced by the method of Kohler & Milstein 
(Nature 256 , 495-497/1975) by immortalising spleen cells from an animal innoculated with the mucin core protein or 
a fragment thereof, usually by fusion with an immortal cell line (preferably a myeloma cell line), of the same or a different 
species as the innoculated animal, followed by the appropriate cloning and screening steps. 

One such antibody is the monoclonal antibody designated SM3 against the HPEM core protein! Thehybridoma 
to cell line which secretes the antibody SM3 has been designated HSM3. Samples of HSM3 have been deposited with 
ECACC on 7th January 1987 under accession number 87010701. 

Using such antibodies it has been possible to screen a phage library constructed from mRNA isolated from a 
human breast cancer cell line to identify sequences coding for portions of the mucin core protein. Complementary DNA 
sequences have been constructed and from these it has surprisingly been found that the gene encoding the core 
ts protein contains multiple tandem repeats of a 60 base sequence leading to considerable polymorphism sufficiently 
extensive that cDNA fragments corresponding to the repeat sequence would be useful for fingerprinting DNA. 

A double tandem repeat has the following sequence: 

5' 

GTC ACC TCG GCC CCG GAC ACC AGG CCG GCC- 

★ 

25 

CCG GGC TCC ACC GCC CCC CCA GCC CAC GGT- 



^ GTC ACC TCG GCC CCG GAC ACC AGG CCG GCC- 

* 3' 

CCG GGC TCC ACC GCC CCC CCA GCC CAC GGT 

35 

When the existence of a tandem repeat sequence was first identified it was believed that the sequence originally 
defined in British Patent Application No. 8700269 consisted of 59 base pairs corresponding with the sequence indicated 
above except for the lack of the base indicated with 

40 The invention provides a polypeptide recognisable by the monoclonal antibody SM3 secreted by the hybridoma 

HSM3 (ECACC 87010701) comprising at least 5 consecutive amino acid residues in a sequence encoded by the DNA 
tandem repeat sequence of mRNA corresponding to the human polymorphic epithelial mucin core protein gene, which 
mRNA is obtainable from the human cancer cell line MCF-7 for use in a method of therapy practised on the human or 
animal body. The invention also provides the use of polypeptides as defined above in the production of a medicament 

45 for use as an immunogen in a method of cancer therapy practised on the human or animal body and the use of such 
polypeptides in jn vitro diagnosis. 

Polypeptides for use according to the invention are selected from any of those having 5 or more amino acid residues 
represented by the following amino acid sequence 

so 

Val Thr Ser Ala Pro Asp Thr Arg Pro Ala Pro Gly Ser Thr 
Ala Pro Pro Ala His Gly*Val Thr Ser Ala Pro Asp Thr Arg 

S5 Pro Ala Pro Gly Ser Thr Ala Pro Pro Ala His Gly 

(”*" marks the start of the repeat in the peptide) . 
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Polypeptides for use according to the invention may have a sequence corresponding with any portion of the 40 
residue sequence above and may include the start point of the repeat sequence. 

Other polypeptides for use according to the invention include three or more repeats of the 20 amino acid repeat 
sequence. Such polypeptides may include minor variations by way of substitution of individual amino acid residues. 

5 The invention further provides polypeptides as defined above modified by addition of N-acetyl galactosamine (a 

linkage sugar)- on serine and/or threonine residues and by addition of oligosaccharide moieties to that or via other 
linkage sugars and/or fragments linked to carrier proteins such as keyhole limpet haemocyanin, albumen or thyroglob- 
ulin. 

Preferably the polypeptide comprises at least 1 0 amino acid residues of the sequence above, more preferably 20 
io residues. The polypeptide may comprise the full sequence above. Such polypeptides may further comprise additional 
amino acid residues, preferably conforming to the amino acid sequence of HPEM core protein. 

The polypeptides for use according to the invention encoded by the tandem repeat may be produced by culturing 
appropriate cells transformed with a vector. A nucleic acid fragment encoding the polypeptide together with any nec- 
essary or desirable flanking sequences (for instance with further coding or non-coding nucleic acid sequences including 
is regulatory and promoter sequences, marker sequences and splicing or ligating sites) is inserted, in an appropriate 
open reading frame register, into a suitable vector such as a plasmid or a viral genome (forjnstance vaccinia virus 
genome) and is then expressed as a polypeptide product by conventional techniques. The polypeptide is then harvested 
and used as an immunogen to induce active immunity against the mucin core protein. Alternatively the vector, partic- 
ularly in the form of a virus, may be directly i rtnoculated into a human or animal to be immunised. The vector then 
20 directs expression of the polypeptide in vivo and this in turn serves as an immunogen to induce active immunity against 
the mucin core protein. 

The HPEM core protein is encoded by the PUM gene and may be produced by recombinant DN A techniques and 
expressed without glycosylation in human or non-human cells. Alternatively it may be obtained by stripping carbohy- 
drate from native human mucin glycoprotein which itself may be produced by isolation from samples of human tissue 
2 S or body fluids or by expression and full processing in a human cell line. The HPEM core protein may be used immunogen 
in active immunisation of humans. - 1 

The invention will now be illustrated by the following Examples and with reference to the figures of the accompa- 
nying drawings in which 

30 Figure Legends 

Figure 1 : Purification of the milk mucin by immunoaffinity chromatography using the antibody HMFG-1 . Milks from 
several individuals were combined and absorbed to a HMFG-1 -Sepharose column as described in methods. The ma- 
terial eluting at low pH was iodinated and subjected to PAGE electrophoresis and autoradiography (track 1). The iodi- 
35 nated material was precipitated using the Protein A method with antibodies HMFG-1 (track 5), HMFG-2 (track 2), ST254 
(track 3) and RPMI + 20% FCS (track 4). 

Figure 2: Comparison of the 125 l labelled purified milk mucin with immunoblot of human skimmed milk. A, human 
skimmed milk was subjected to SDS polyacrylmide electrophoresis, transferred to nitrocellulose paper, the blot probed 
with the monoclonal antibody HMPG-1 and binding detected using an ELISA method. B, after purification on an HMPG- 
40 i affinity column followed by G75 Sephadex chromatography the milk mucin was iodinated by the Bolton and Hunter 
method and subjected to SDS polyacrylamide electrophoresis and autoratiography. 

Figure 3: Autoradiography of the iodinated milk mucin after treatment with hydrogen fluoride. The purified milk 
mucin was treated with HF for 3 hours at room temperature (track 1) or 1 hour at 4°C (track 2) and the resulting 
preparations were then iodinated and run on SDS polyacrylamide gels. 

45 Figure 4: Reactivity of the intact, partially stripped or extensively stripped milk mucin with iodinated lectins. The 

purified intact milk mucin (track 1), the mucin treated with HF for 1 hour at 4°C (track 2) and the mucin treated for 3 
hours at room temperature (track 3) were subjected to SDS polyacrylamide electrophoresis and then transferred to 
nitrocellulose paper. The paper was then probed with 125 l PNA (peanut agglutinin), 125 l WGA (wheat germ agglutinin), 
or 125 l HPA (Helix pomatia agglutinin). 

so Figure 5: Immunoprecipitation and immunoblots of the partially and extensively stripped mucin. A, the 125 l exten- 

sively stripped mucin was immunoprecipitated with SM-3 (track 3), HMFG-2 (track 2) or NS2 medium as a control 
(track 1) by the protein A plate method (see Materials and methods). B, the partially stripped mucin (track 1 ) or exten- 
sively stripped mucin (track 2) was run on SDS polyacrylamide gels and transferred to nitrocellulose paper. The blot 
was then reacted with a cocktail of SM-3 and SM-4 monoclonal antibodies and the binding detected using an ELISA 
ss method. 

Figure 6: Reactivity of monoclonal anticodies SM-3 and HMFG-2 with methacarn fixed breast tissue and tumour 
sections using an indirect immunoperoxidase staining method. Infiltrating ductal carcinoma showing strong reactivity 
with both SM-3 (A) and HMPG-2 (B). Fibroadenoma showing no reactivity with SM-3 (C) and strong heterogeneous 
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staining of the epithelium with HMFG-2 (D). Papilloma showing very weak reactivity with SM-3 (E) and strong positivity 
with HMPG-2 (F). Both normal resting breast (G) and lactating breast (I) were negative when stained with SM-3, where- 
as both tissues stained positively with HMPG-2 with lactating breast (J) much stronger than normal resting breast (H). 

Figure 7 . Periodic acid-silver stained milk mucin after antibody affinity column and gel filtration column. Milk mucin 
5 was purified on an HMFG-1 antibody affinity column (lane 1) followed by passage through a G75 Sephadex column 
(lane 2), subjected to NaDodS0 4 /poly acrylamide gel electrophoresis, and silver stained following treatment of gels 
with 0.2% periodic acid. 

Figure 8 . Silver stain of partially and totally stripped core protein from milk mucin. The purified milk mucin was 
; deglycosylated by treatment with anhydrous hydrogen fluoride for 1 hr at 0°C (lane 1) and 3 hr at room temperature 
io (lane 2), separated by electrophoresis through a NaDodS0 4 /polyacrylamide gel (10%) and silver stained. 

Figure 9. Immunoprecipitation with Mabs of in vitro translated protein products from MCF-7 poly(A) + RNA. Poly 
(A) + RNA from MCF-7 cells was translated in a rabbit reticulocyte lysate system (Amersham) in the presence of [ 35 S] 
methionine (1000 Ci/mmole; ICi = 37 GBq) following the manufacturer's conditions. Samples containing 5 x 10 4 acid 
precipitable cpm were precipitated with MAbs SM-4 (lane a), SM-3 (lane b), HMFG-2 (lane c), HMFG-1 (lane d) and 
15 an irrelevant MAb to interferon (lane e, 24), separated on a NaDodS0 4 /polyacrylamide gel (10%), impregnated with 
Amplify and exposed to XAR-5 film at -70°C for 20 days. 

Figure 10 . Immunoblot analysis of fusion proteins from the \muc clones. The phage clones XMUC 3,4, 6,7, 8,9 and 
10 were used to lysogenize bacterial strain Y 1089. Lysogens were grown at 32°C, shifted to 42°C, and then induced 
with IPTG. Lysogen proteins were fractionated by electrophoresis through a NaDodSCtypolyacrylamide gel (7.5%), 
20 transferred to nitrocellulose, and reached with HMFG-2. The binding was detected with an ELISA method using 4-chlo- 
ro-1 -naphthol as the substrate. The numbers are those of the X clones. 

Figure 11 . Hybridization of pMUCIO to cDNA inserts of pMUC clones. DNA from the plasmid clones was digested 
with restriction enzyme EcoRI to excise the cDNA inserts, separated by electrophoresis on 1 .4% agarose and trans- 
ferred to Biodyne nylon membrane. The filter was hybridized using standard conditions (34) to the insert from pMUCIO 
25 which was labelled with [a- 32 p]dCTP by the method of random priming (41). Lanes: plasmid clones 3,4.6,7,8,9,10. 

Figure 12 . RNA blot hybridization analysis of mammary breast mucin mRNA. 1 0 pg of total RNA from human breast 
cancer cells MCF-7 (lane 1) and T47D (lane 2), normal human mammary epithelial cells HuME (lane 3), human em- 
bryonic fibroblasts ICRF 23 (lane 4), Daudi cells (lane 5) and carcinosarcoma HS578T cells (lane 6) were separated 
in a 1.3% agarose/glyoxal gel, blotted on to nitrocellulose and hybridized to the pMUCIO EcoRI insert which was 
30 labelled with [a- 32 p]dCTP by the method of random priming (41). The size markers were28S (5.4 kb) and 18S (2.1 kb) 
rRNAs. 

Figure 13 . Polymorphic human DNA fragments detected by hybridization with pMUCIO probe. Genomic DNA 
samples prepared from the white blood cells from ten individuals (six unrelated) and from three cell lines were digested 
to completion with Hinfl and EcoRI, fractionated by electrphoresis through 0.7% and 0.6% agarose, respectively, and 
35 transferred to Biodyne nylon membranes. The filter was hybridized to the pMUCI 0 DNA insert which was labelled with 
[a 32 pjdCTP by the method of random priming (41 ). X-ray film was exposed for 1 day at -70°C with intensifying screens. 
Lanes 1-4 father, two daughters and mother, lanes 5-10 unrelated individuals, lane 11 is MCF-7, lane 12 is ZR75-1, 
lane 13 is ICRF-23. The DNA samples exhibit a wide distribution of sizes. Numbers indicate length of DNA in kb. The 
apparent bands at 23Kb are in lanes 12 and 13 are artefacts introduced in autoradiography. 

40 

Example 1 

Purification of the milk mucin 

45 The milk mucin was purified from human skimmed milk by passage through an HMFG-1 affinity column followed 

by size exclusion chromatography. The HMFG-1 monoclonal antibody was purified from tissue culture supernatant 
using a protein A column (1 ). The purified antibody was coupled to cyanogen bromide activated sepharose (Pharmacia) 
as described in the manufacturer's instructions. Human skimmed milk was passed in batches of 100 ml through the 
antibody column followed by extensive washing with PBS. Bound antigen was eluted from the column using 0.1 M 
so glycine pH 2.5 and the fractions registering an optical density at 280nm were pooled, dialysed against 0.25 M acetic 
acid and lyophilized. Batches of about 20 mgs were dissolved in 0.25 M acetic acid and passed through a G75 Sephadex 
column (1 x 100 cm) which had been previously equiligrated with acetic acid. The column was washed with 0.25 M 
acetic acid and 1 ml fractions collected. The peak fractions which were eluted in the void volume were pooled, lyophilized 
and the dry powder stored at 4°C. Amino acid analysis was performed using a Beckman 6300 amino acid analyser. 

55 

Deqlycosylation of the milk mucin 

To remove the O-linked carbohydrate from the milk mucin the molecule was treated with anhydrous hydrogen 
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fluoride as described by Mort and Lamport (21 ), for either 1 hour at 4°C which produced a partially stripped preparation, 
or 3 hours at room temperature which produced the extensively stripped mucin. 

lodination of the milk mucin 
5 

lodinations of the purified mucin, the partially or extensively stripped mucin were carried out using the Bolton and 
Hunter method (51 ). Briefly, the mucin, 2.5 pg in 20 pi 0.1 M borate buffer pH 8.5, was added to the dried Bolton and 
Hunter reagent (1 mCi, Amersham International pic) and incubated at room temperature for 15 minutes. The reaction 
was stopped by the addition of 0.5 ml of 0.2M glycine in borate buffer and after a further 15 minutes incubation, free 
io Bolton and Hunter reagent was removed by passage through a G25 Sephadex column (PD10 columns, Pharmacia) 
previously equilibriated in PBS. 

lodination of Lectins 

15 Wheat germ agglutinin (WGA), peanut agglutinin (PNA) (Vector Labs) and Helix pomatia agglutinin (HPA) (Boe- 

hringer) were iodinated as described by Karlsson et al. (52) using the chloramine T method. 

Polyacrylamide gels and Western blots 

20 Polyacrylamide gel electrophoresis and immunoblotting was performed as described previously (1). Briefly, sam- 

ples were run on 5-1 5% polyacrylamide gels and then electrophoretically transferred to nitrocellulose paper (Schleicher 
and Schuell) at 50 volts overnight at 4°C (36). In the immunoblotting experiments the paper was reacted with monoclonal 
antibodies and binding detected with an ELISA method using 4-chloro-1-naphthoJ as the substrate. For lectin binding 
studies the Western blots were reacted with the iodinated lectins as described by Swallow et al. (48). 

25 

Production of monoclonal antibodies 

A female BALB/c mouse was immunized with 5 pg of the partially stripped milk mucin in Freund's complete adjuvant 
and 3 months later boosted with a further 5 pg of the same preparation in Freund’s incomplete adjuvant. After a further 
30 20 days, 5 pg of the mucin extensively stripped of its carbohydrate was given intravenously in saline solution. The 

spleen was removed 4 days later, and fused with the NS2 mouse myeloma cell line (53). 

Screening of hybridoma supernatant and immunoprecipitations 

35 The screening assay was a modification of that described by Melero and Gonzalez-Rodriguez (54). Multiwell plates 

were coated with 50 pi of 0.1 mg/mf protein A (Pharmacia Fine Chemicals) in PBS and allowed to dry overnight at 
37°C. The plates were blocked with 5% BSA for 1 hour at 37°C followed by the addition of 50 pi of rabbit anti-mouse 
immunoglobulin (DAKO, diluted 1:10 in PBS/BSA = PBS/BSA). After incubating for 2 hours at 37°C the plates were 
washed twice with PBS containing 1% BSA and 50 pi of hybridoma supernatant added. The plates were incubated 
40 overnight at 4°C, washed twice with PBS/BSA and 50 pi of iodinated partially stripped mucin containing 1 00,000 cpm 
added to each well. The plates were then incubated at room temperature for 4 hours, washed 4 times with PBS/BSA 
and the individual wells counted in a gamma counter. For immunoprecipitation experiments 50 pi of SDS sample buffer 
containing dithiothreitol was added to each of the wells which were then boiled for 3 minutes and the buffer loaded 
onto 5-15% polyacrylamide gradient gels. 

45 

Staining of tissue sections 

Tissues from primary mammary carcinomas, benign breast biopsies, normal breast, and pregnant lactating breast 
tissue were fixed in methacarn (methanol chloroform and acetic acid 60:30:10) and embedded into paraffin wax. Sec- 
50 tions were stained with the antibodies using the indirect peroxidase anti peroxidase method as previously described 
(47).. 

Results 

55 Purification of the milk mucin 



The milk mucin was purified from human skimmed milk on an HMFG-1 antibody affinity column, lodination of the 
eluted material revealed the presence of a large molecular weight component and a 68KD band. Precipitation of the 
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affinity purified material with antibodies HMFG-1 and HMFG-2 (tracks 2 and 5) followed by gel electrophoresis showed 
that both the high molecular weight components and the 68KD component were precipitated by both antibodies (less 
effectively by HMFG-2). Since the 68KD component was also precipitated by two unrelated antibodies (figure 1 , tracks 
3 and 4) and this component was not evident on an immunoblot of the purified material reacted with HMFG-1 (figure 
5 2A), the 68K component was removed by molecular sieve chromatography on a G75 column. The final purified product 

showed a major high molecular weight band, with only a trace of the 68K component and a minor contaminant around 
14K (figure 2B). 

A high molecular weight glycoprotein (PAS-O) containing more than 50% carbohydrate in O-linkage has been 
purified from the human milk fat globule by Shimizu and Yamauchi (8). To see whether this component was similar to ' 
io the mucin isolated from milk by affinity chromatography on an HMFG-1 affinity column, the amino acid composition of 
the purified HMFG-1 reactive mucin was determined and compared to the amino acid composition of the purified PAS- 
O component. Table 1 shows that there is good correspondence between the two sets of data, indicating that the core 
proteins of PAS-0 and the mucin purified here are the same. 

75 Isolation of the core protein of the milk mucin 

As there are no enzymes easily available that are efficient at removing O-linked sugars, and {$ elimination often 
results in damage to the protein core, the oligosaccharides were removed by treatment of the mucin with anhydrous 
hydrogen fluoride. This treatment has been shown by Mort and Lamport (21) to be effective in removing sugars from 
20 pig submaxillary mucin without damaging the protein core. Amino acid analysis of the material produced after HF 
treatment of the milk mucin suggested that the protein core was also in this case undamaged, since the composition 
was the same as that seen in the intact mucin (Table 1 ). 

Initially the milk mucin was exposed to HF for only 1 hour at 4°, but analysis of the product showed that there was 
only partial removal of the sugars with such treatment, and it was necessary to treat the mucin at room temperature 
25 for 3 hours to obtain a molecule which showed no lectin binding ability. Figure 3 shows an autoradiograph of the 
iodinated products after treatment for 1 hour at 4° (track 2) or 3 hours at FIT (track 1 ). It can be seen from Figure 3 that 
the milder treatment results in a mixture of products made up of high molecular weight material which is slightly smaller 
than the intact mucin and a number of smaller bands. After longer exposure to HF at room temperature, the high 
molecular weight bands disappeared resulting in polypeptide bands of about 68KD and 72KD. 

30 To test for the presence of sugars on the intact mucin and on the products produced after the two different HF 

treatments each preparation was subjected to acrylamide gel electrophoresis, transferred to nitrocellulose paper and 
reacted with 125 l-labelled lectins. The lectins used were peanut lectin (PNA) which reacts with galactose linked to N- 
acetyl galactosamine, wheat germ (WGA) reactive with N-acetyl glucosamine and Helix pomatia agglutinin (HPA) which 
reacts with the linkage sugar in O-linked glycosylation, N-acetylgalactosamine. Figure 4 shows autoradiographs of the 
35 reacted blots, and it can be seen that while treatment with HF for 1 hr at 4 a (track 2) alters the lectin reactivity of the 
mucin, carbohydrate is still present. Interestingly, however, there is a much lower level of binding of PNA to the high 
molecular weight material of the partially stripped mucin than is seen with the intact mucin (track 1). Moreover, this 
loss in PNA binding ability is accompanied by binding of the linkage sugar specific lectin HPA. This lectin shows no 
binding at all to the intact mucin, and the changed pattern of lectin binding after limited treatment with HF indicates 
40 that sugars masking the O-linked N-acetylgalactosamine have been stripped off. The smaller component seen in both . 
the intact mucin (track 1) and in the partially stripped preparation (track 2) is a glycoprotein which reacts with WGA, 
although not with PNA. This may correspond to the component of similar molecular weight (around 68K) seen after 
affinity chromatography of the mucin and may represent an intermediate precursor molecule. 

Figure 4 shows clearly that the 68K and 72K components produced after extensive treatment with HF (3 hr at RT), 
45 show no reactivity with the lectins (track 3), including the N-acetylgalactosamine specific lectin HPA. This observation 
constitutes strong evidence that the sugars have been removed from at least the majority of the molecules, and we 
will refer to this preparation as the extensively stripped mucin. 

Generation of monoclonal antibodies to the milk mucin core protein 
so 

A fusion was carried out using the spleen of a mouse that had been immunized with two injections of the partially 
stripped milk mucin followed by a boost with the extensively stripped mucin. The clones were initially screened against- 
. the 125 l partially stripped material using protein A plates (see Methods). Four hybridomas were selected and cloned, 
and table 2 shows their spectrum of reactivity with the intact, partially and extensively stripped mucin. As can be seen 
55 from this table three of the hybridomas which were isolated showed a strong reaction with the partially and extensively 
stripped mucin and did not react with the intact mucin. There appeared to be good candidates for monoclonal antibodies 
to the protein core and two, SM-3 and SM-4, were selected to be characterised further. 

It can also be seen from table 2 that the HMFG-1 and HMFG-2 antibodies reacted very strongly with the mucin 
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stripped of its carbohydrate. These two antibodies were, in fact, developed using the intact mucin (from the milk fat 
globule) as immunogen and, in the case of HMFG-2, growing mammary epithelial cells (14). Their reaction with the 
stripped mucin was unexpected, as circumstantial evidence had previously led to the belief that carbohydrate might 
form at least part of their antigenic epitopes. 

5 

Molecular Weight of molecules carrying antigenic determinants 

The antibody SM-3 was shown to be of the IgGI subclass, while the SM-4 antibody was found to be IgM. We 
therefore chose to use the SM-3 antibody in subsequent experiments since antibodies of the IgM class can present 
io problems in some appliction. Immunoprecipitation of the extensively stripped material with SM-3 showed a reaction 
with the lectin unreactive 68K component (Figure 5A, track 3). The monoclonal antibody HMFG-2 can also be seen to 
immune precipitate the lectin-unreactive 68K component (track 2\ The antibodies were reactive with antigen on im- 
munoblots and Figure 5B shows the reaction of antibody SM-3 with the dominant 68K band of the extensively stripped 
mucin (track 2). 

is We have previously shown that the molecular weight of the components in breast cancer cells carrying determi- 

nants found on the milk mucin is lower than 400K and can vary from one tumour to another (1 ). Reaction of antibody 
SM-3 with Western blots of gel separated extracts of breast tumour cells shows that this antibody reacts with compo- 
nents of similar molecular weight to those reactive with antibody HMFG-2 (data not shown). Because the antibody SM- 
3 differs from the antibodies HMFG-1 and 2 in that it does not react with the intact mucin processed by the lactating 
20 gland and yet reacts with molecules processed by breast cancer cells, it was appropriate to examine the reaction of 
SM-3 with a range of breast cancers. 

Reactivity of SM-3 with breast tissues and tumours 

25 The antibody SM-3 reacted with paraffin embedded tissues provided these were fixed in methacarn (not formal 

saline). Using this method for preparation of tissue sections, the reaction of the antibody was compared to that of 
HMFG-2 on breast tissues and tumours with an indirect immunoperoxidase staining method. This analysis showed a 
dramatic difference in the staining pattern of SM-3 compared to that seen with, HMFG-2. Thus, although a strong 
positive reaction was seen in 20/22 breast cancers stained with SM-3 (as compared to 22/22 stained with HMFG-2), 
20 normal resting breast, pregnant or lactating tissues and most benign lesions were largely unstained with SM-3 but 
were stained with HMFG-2. Some examples of staining patterns of breast tissues and tumours are illustrted in Figure 6. 

Twenty-two primary carcinomas and fourteen benign lesions were examined and the reaction of SM-3 compared 
to the staining with HMFG-2 in each case, in the primary carcinomas, staining with SM-3 was heterogeneous but 
generally quite strong and always confined to tumour cells; connective tissue and stroma showed no reaction (see 
35 figures 6A,B). In the four fibroadenomas examined, staining of the epithelium with HMFG-2 was strong although het- 
erogeneous. In comparison, staining with SM-3 was negative in one case and in the three others staining was confined 
to only one or two glandular elements. HMFG-2 showed strong positivity on the five papillomas and five cases of cystic 
disease studied while the staining observed with SM-3 was very such weaker and more heterogeneous (figures 6G, 
H). The papillomas as a group showed the strongest staining with SM-3, and it can be seen that the staining was 
40 membranous or extracellular. 

In contrast to HMFG-1 and HMFG-2 which strongly stain lactating and pregnant breast, SM-3 was totally negative 
with three out of six cases of pregnant or lactating breast (see figure 6C and D). Two positive cases showed only very 
weak staining of an occasional cell and in the third, staining was confined to two areas of one lobule. Again, in contrast 
to HMFG-1 and HMFG-2 which do react with some terminal ductal lobular units of normal, resting breast (albeit weakly), 
45 SM-3 was totally negative on eight out of the thirteen cases tested and in the other five cases staining was extremely 
weak and often confined to one or two acini in the tissue section (see figure 6E and F). It should perhaps be noted that 
the intensity of staining with HMFG-2 seen with normal breast tissues and benign lesions fixed in methacarn was 
somewhat higher than that reported previously using formalin fixed material (50,47). 

SM-3 was also shown to be negative on sections of normal liver, lung, thymus, sweat gland, epididymus, prostate, 
so bladder, small intestine, large intestine, appendix, thyroid and skin. The antibody showed weak positive staining only 
with the distal tubules of the kidney, the occasional chief cell of the stomach, the occasional duct cell of the salivary 
gland and the sebaceous gland. 

Discussion 

55 

Large molecular weight mucin molecules are expressed by many carcinomas and carry many of the tumour as- 
sociated antigenic determinants recognised by monoclonal antibodies. These epitopes may also be expressed by 
some normal epithelium, and some monoclonal antibodies like HMFG-1 react particularly well with a mucin found in 
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normal human milk (1,17). As long as the study of the mucins is restricted to their detection with antibodies reactive 
with undefined epitopes, the knowledge of their structure, expression and processing will also be restricted. We have 
begun to investigate the structure and expression of the mammary mucin by isolating the core protein and developing 
antibodies which have allowed us to select partial cDNA clones for the gene coding for the core protein . This Example 
s describes the production and characterization of these antibodies. 

Treatment of the HMFG-1 affinity purified milk mucin with hydrogen fluoride resufted in the appearance of a dom- 
inant band of about 68K daltons and a minor species of about 72KD on SDS acrylamide gets. These bands showed 
no reactivity with lectins, including Helix pomatia agglutinin which is specific for N-acetyl galactosamine, the first sugar 
in O-linked glycosylation (55). It therefore seems probable that this 68K datton polypeptide represents the core protein 
io of the mucin. Supportive evidence for this comes from the observation that the antibodies described here, which are 
reactive with the stripped 68K component, can precipitate a molecule of this size from the in vitro translation products 
of mRNA isolated from breast cancer cells expressing the mucin . 

As the milk mucin contains at least 50% carbohydrate (16), a protein core of only 68KD appears too small if the 
intact molecule has an observed molecular weight greater than 400KD. However, mucins can be composed of small 
is subunits which aggregate and are held together by some form of non-covalent interactions, as yet not understood. For 
example, although the molecular weight of the ovine submaxillary mucin has been reported to be greater than 1 x 10 6 
daltons (45), it has a protein core of only 650 amino acids with a molecular weight of 58,300 daltons (46). 

An unexpected finding was that the antibodies HMFG-1 and HMFG-2 which react with the milk mucin, also show 
a positive reaction with the extensively stripped material which showed no lectin binding capability. Previous indirect 
20 evidence, including the resistance to fixation, boiling and reduction, the repetitive nature of their epitopes and the 
appearance of several bands on immunoblots, had led to the belief that carbohydrate present on the milk mucin was 
involved in these epitopes. This idea was reinforced by the observation that lectins could block the binding of HMFG- 
1 and 2 (1). While it is not possible to exclude the possibility that some sugars, undetected by the lectin binding exper- 
iments, remain on the extensively stripped mucin described here, this is unlikely to be the explanation for the reactivity 
25 of the antibodies HMFG-1 and 2. This can be said since both antibodies have recently been shown to react positively 
with (i-galactosidase fusion proteins expressed by phage carrying DNA coding for the core protein of the mammary 
mucin. It appears therefore that at least part of each of the epitopes recognised by HMFG-1 and HMFG-2 contain 
amino acids but it must be assumed that some of these epitopes on the core protein are exposed, i.e. not masked in 
the fully glycosylated molecule. The HMFG-2 epitope is however less abundant on the milk mucin than the HMFG-1 
30 epitope, while it is readily detectable on the mucin molecules expressed by tumours (1). These molecules have a 

smaller molecular weight and may be less heavily glycosylated or polymerized. 

Here we have reported the development of new antibodies which are reactive with the protein core of the mucin 
and with the partially deglycosylated molecule, but which are unreactive with the fully processed mucin produced by 
the lactating mammary gland. One of these antibodies SM-3, which is an IgGI , has been studied in more detail. It has 
35 been shown to react with the mucin molecules which are produced by breast cancer cells and are recognised by many 

antibodies developed against the intact milk mucin. It should be amphasized however that the epitope recognised by 
SM-3 which is on the core protein and is exposed in the mucin as processed by tumour cells, is not exposed on the 
normally processed milk mucin. This feature offers the possibility of enhanced tumour specificity, and a pilot immuno- 
histochemical study of breast tumours and tissues has shown that indeed the SM-3 antibody reacts strongly with the 
40 majority of primary breast cancers (91 %) but shows little or no reaction with benign breast tumours, resting or lactating 
breast, and most normal tissues. 

There are several implications of the work described here which may be important for both basic and clinical studies 
in breast cancer. The observation that parts of the core protein (detectable by antibodies) are exposed on the mucins 
as processed by breast cancer but masked on the mucin as processed by cells in normal breast and benign tumours 
45 implies that there is an alteration in the processing of the mucin in malignancy. A more detailed study of the processing 
of the mucin in normal and malignant cells may then give basic information for defining the malignant cell. Moreover, 
since the specificity of the reaction of the antibody SM-3 for tumours is better than that of antibodies developed against 
the intact mucin, this antibody may prove to be a more effective diagnostic tool for the detection of breast cancer cells 
in tissue sections, tissue fluids and cells. The reactive components are membrane associated as well as intracellular 
50 and in vivo localisation of tumours may also be possible. 

Abbreviations 

The abbreviations used are : HMFG, human milk fat globule; PBS, phosphate-buffered saline (153 mM NaCI, 3 
55 mM KCI, 10 mM Na 2 HP0 4 , 2 mM KM 2 P0 4 pH 7.4); WGA, wheat germ agglutinin; PNA, peanut agglutinin; HPA, Helix 
pomatia agglutinin; BSA, bovine serum albumin; SDS, sodium dodecyl sulfate. 
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Example 2 

Purification and deglycosylation of human milk mucin was conducted as in Example 1 mucin was purified on an 
HMFG-1 antibody. 

5 The striped mucin preparations were separated by electrophoresis through NaDodS0 4 /polyacrylamide gels (1 0%) 

and silver stained by two methods, one of which can be used to stain highly glycosylated proteins (22,23). 

Preparation of polyclonal rabbit antiserum to stripped core protein 

to One New Zealand White rabbit was immunized with 100 pg of the partially stripped core protein in complete Fre- 

und's adjuvant (Gibco). Booster injections of 500 pg of the totally stripped core protein were administered in incomplete 
Freund’s adjuvant (Gibco) 3 and 4 weeks after the initial injection and the rabbit was bled one week later. Ten microliters 
of immune serum (75 pg/ml protein) precipitated 200 ng of fully stripped core protein in a Protein A assay (24) and 
detected it on immunoblots. The immunoglobulin fractions of rabbit preimmune and rabbit anti-mucin core protein were 
is prepared by adding ammonium sulfate to 50% saturation. The resulting pellet was resuspended in one-half the original 
serum volume of PBS and dialyzed against the same buffer. After dialysis, only residual precipitate was removed by 
centrifugation. Immunoglobulin fractions were stored in aliquots at -20° C. 

Description of MAbs used 

20 

In addition to the polyclonal antiserum used for initial screening, a cocktail of two MAbs, SM-3 and SM-4 (see 
Example 1) which recognise the mucin core protein (20) and HMFG-1 and HMFG-2 (1,14) were used to screen the 
purified plaques, the p-galactosidase fusion proteins and immunoprecipitations from in vitro translated proteins*. Other 
MAbs used were a monoclonal anti-p-galactosidase antibody (25) which was a gift from H. Durbin (ICRF, London), an 
25 anti -interferon antibody, ST254 (24), LE61 a keratin antibody (26) and Ml B which recognizes a carbohydrate structure 

on the milk mucin (27). 

In Vitro translation of proteins 

30 RNA was isolated from the human breast cancer cell line MCF-7 using the guanidium isothiocyanate method of 

Chirgwin et at. (28) and poly(A) + RNA was purified by chromatography using oligo (dT)-cellulose (New England Bio 
Labs). The poiy(A) + NRNA was translated in a reticulocyte lysate system (Amersham) in the presence of p 5 S] methio- 
nine (1000 Ci/mmo!e; 1 Ci = 37 GBq, Amersham). Samples containing 5 x 10 4 acid insoluble cpm were precipitated 
in a protein A assay (24) using MAbs SM-3, SM-4, HMFG-1 , HMFG-2 and a control antibody to human interferon. The 
35 antibody-selected proteins were then separated on a 10% NaDodS0 4 /poly acrylamide gel, impregnated with Amplify 
(Amersham) and exposed to XAR-5 film (Kodak) at -70°0. 

Antibody screening of A,qt11 library and protein blotting 

40 The Xgtll library used In this study was constructed from mRNA isolated from the human breast cancer cell line 
MCF-7 and was generously provided by Philippe Walter and Pierre Chambon (Strasbourg, France). The poly (A) + RNA 
used for the preparation of the randomly primed library was prepared from mRNA that sedimented faster than 28S 
rRNA and was enriched in estrogen receptor (29). The library was made essentially as described by Huynh et al. and 
Young and Davis (30-32) and contained approximately 1 x 10 6 recombinants per pg of RNA. Between 85% and 95% 
45 of the plaques contained inserts. 

The phage library was plated onto bacterial strain Y1090 and grown for 3 hr at 42°C. After isopropyl p-D-thioga- 
lactoside (IPTG) induction and 3 hr of growth at 37°C, filters were prepared from each plate and screened with anti- 
mucin core protein antibody by the method of Young and Davis (32). The first antibody used in screening was the rabbit 
antiserum raised against the strippedcore protein prepared as described above. Priortouse in screening, the antiserum 
so was diluted 1:200 in PBS containing 1% bovine serum albumin (PBS/BSA). Preabsorption with Y1090 bacterial lysate 
was not found to be necessary. The nitrocellulose filters (Schleicher and Schuell) were blocked by incubation in PBS 
containing 5% BSAfor 1 hr at room temperature with gentle agitation. The filters were incubated at room temperature 
overnight with a 1 :200 dilution of antiserum in heat sealed plastic bags. The filters were washed 5x5 min in PBS/BSA, 
and bound antibody was detected by using horseradish peroxidase-conjugated sheep anti-rabbit antiserum (Dako) 
55 diluted 1 :500 with PBS/BSA and incubated for 2 hr with the filters. The filters were washed 5x5 min in PBS/BSA and 
1 x 10 min in PBS before color detection using 4-chloro-1-naphthol (1). Immunoreactive bacteriophage were picked 

*The MAbs SM-3 and SM-4 (SM refers to stripped mucin) show strong reactivity with the partially and fully stripped core protein but no reactivity with 
the fully glycosylated mucin (20). 
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and purified through two additional rounds of screening. Subsequently, bacteriophage inserts were subcloned into the 
EcoRI sites of pUC8 (33) producing the plasmid used most extensively, pMUC f 0. The plasmids were maintained in 
DH1 ceils. 

To examine the p-galactoridase-cDNA fusion proteins for immunoreactivity, cell lysates were derived. Lysogens 
5 were prepared as described in Young and Davis (34). Cells were pelleted, suspended in Laemmli sample buffer (35) 
and separated by electrophoresis through NaDOdSO 4 /p0lyacrylamide gels (10%) and transferred onto nitrocellulose 
filters as described (1,36). The filters were treated as above for antibody screening . 

Northern Analysis 

10 

RN A was isolated from tissue culture cells and frozen tissues by the guanidinium isothiocyanate method of Chirgwin 
et al. (28). Total RNA (10 pg per lane) was denatured by heating at 55°C for 1 hr in deionized glyoxal and fractionated 
by electrophoresis through a 1.3% glyoxal gel (38). The RNA was transferred to nitrocellulose (Schleicher and Schuell), 
prehybridized and hybridized as described by Thomas (34). Filters were washed down to 0.1X SSC with 0.1% SDS at 
is 65°C and exposed to XAR-5 film (Kodak) at -70°C with intensifying screens. 

Southern analysis 

High molecular weight genomic DNA was prepared from white blood cells and cell lines (39,40). These genomic 
20 DNAs (lOpg) were cleaved with restriction enzymes following the manufacturer's recommended conditions and frac- 
tioned through 0.6% and 0.7% agarose gels. Cloned plasmid DNA was cleaved and fractionated on 1.3% agarose. 
The gels were denatured, neutralized and transferred to nylon membranes (Biodyne) according to the manufacturer’s 
instructions. The EcoRi insert from pMUClOwas separated on a 1% low melting point agarose (Biorad) gel and labelled 
with [a- 32 P]dCTP by the method of random priming (41) and hybridized to filters at 42°C. Filters were washed down 
25 to 0.1 X SSC with 0.1% SDS at 55° C and exposed to XAR-5 film (Kodak) at -70° C with intensifying screens. 

Results 

Purification and deqlycosylation of mucin glycoprotein 

30 

Mucin glycoprotein reactive with the monoclonal antibody HMFG-1 was prepared from pooled human breast milk 
by using an HMFG-1 antibody affinity column, followed by molecular sieve chromatography on Sephadex G-75 in order 
to remove lower molecular weight components (Figure 7, lane 1). In order to demonstrate the homogeneity of the 
purified molecule, amino acid analyses of four separate preparations were performed and revealed a fairly consistent 
35 composition with serine, threonine, proline, alanine and glycine accounting for 58% of the amino acids. Periodic acid 
silver stained gels revealed a diffuse band of greater than 400,000 daltons visible only when the gel was treated with 
periodic acid before the silver stain (Fig. 7, lane 2). No other lower molecular weight bands were visualized on the gel 
using the silver stain without prior treatment with periodic acid. 

The purified material was subjected to treatment with hydrogen fluoride to remove the O-linked sugars that are 
to characteristic of mucin glycoproteins. Two different reaction conditions were used which resulted in a partially degly- 
cosylated core protein (treated at 0°C for 1 hr) and a fully deglycosylated core protein (treated at room temperature 
for 3 hr) as determined by iodinated lectin binding following separation by gel electrophoresis and transfer to nitrocel- 
lulose paper (20). The partially deglycosylated core protein was reactive with wheat germ agglutinin, peanut agglutinin 
and helix pommatia lectin (which recognizes the linkage sugar N-acetylgalactosamine) whereas the fully stripped pro- 
45 tein showed no reactivity with any of these three lectins. 

The hydrogen fluoride treated core protein was separated by electrophoresis through NaDOdS0 4 /polyacrylamide 
gels (10%) and silver stained. Silver staining revealed that the predominant component of the partially stripped mucin 
was a high molecular weight band of about 400 kd, although faint bands of lower molecular weight could also be 
observed (Fig. 8, lane 1). Since the high molecular weight material showed a somewhat increased mobility in the gel 
50 and reacted with the lectin recognising the linkage sugar, it can be assumed that some sugars had been removed. The 
fully stripped mucin consisted of two bands of about 68 kd and 72 kd (Fig. 8, lane 2). 



Antibody reactive proteins produced by MCF-7 cells 

The MCF-7 breast cancer cell line expresses large amounts of HMFG-1 and -2 reactive material on its cell surface 
(14) and was thus judged to be a suitable source of mRNA for a cDNA library. Before proceeding to screen the MCF- 
7 library with the monoclonal antibodies, they were tested for their ability to precipitate a component from in vitro 
translation products produced from MCF-7 mRNA. Poly (A) + RNA from MCF-7 was prepared and translated in vitro. 
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Proteins from the translation reaction were immunoprecipitated using the monoclonal antibodies HMFG-1 , HMFG-2, 
SM-3 and SM-4 and displayed by polyacrylamide gel electrophoresis and fluorography (Fig. 9). Two proteins of about 
68 kd and 92 kd were immunoprecipitated by SM-3 (lane 2) and SM-4 (lane 1). It was also found that HMFG-1 (lane 
4) and HMFG-2 (lane 3) immunoprecipitated these proteins; however, no bands in these areas were precipitated by 
5 an irrelevant monoclonal antibody to human interferon (lane 5). The fact that HMFG-1 and -2 immunoprecipitated these 
proteins was an unexpected finding as it was previously thought that these MAbs recognize carbohydrate determinants 
(1). However, we also found that HMFG-1 and -2 react very strongly with the fully stripped, iodinated core protein (20). 
These results together with the MAb reactions on the p-galactosidase fusion proteins (see below) confirm that the 
epitopes for HMFG-1 and -2 are, at least in part, protein in nature. ' ' ' - - - 

10 The abundance of the core protein mRNA in total cellular poly (A) + RNA was 4% as estimated by comparing the 

amount of ( 35 S)methionine present as immunoprecipitated protein to the amount of methionine incorporated into total 
protein during in vitro translation. 

Screening of the cDNA library 
is 

The X,gt11 cDNA library made from size selected MCF-7 mRNA (see Methods) was screened initially with the 
polyclonal antiserum made to the mucin core protein which had been stripped of its carbohydrate. Screening of 2 x 
10 6 plaques resulted in 11 positive clones, 7 of which were taken successfully through two further rounds of plaque 
purification. 

20 To demonstrate that the reactivity of the phage clones with the antibody probes was due to antigenic determinants 

on the cDNA translation product, p-galactosidase fusion proteins were made from all 7 clones. The proteins were 
separated by electrophoresis, transferred to nitrocellulose paper and probed with a variety of antibodies to the stripped 
mucin, including the polyclonal antiserum which was used initially to select the clones and a cocktail of SM-3 and SM- 
4. In addition, HMFG-1 and HMFG-2, the two monoclonal antibodies which originally detected this differentiation and 
25 tumour-associated epithelial mucin (1,14) were tested. All 7 clones yielded fusion proteins which were specifically 
recognized by the polyclonal antiserum, the monoclonal cocktail, and HMFG-2. HMFG-1 antibody reacted with 6 of 
the 7 fusion proteins and failed to recognize the protein from clone 9 which contains the smallest insert. In every case 
the strongest signal was given by the HMFG-2 antibody and this reaction is shown in Figure 10. Monoclonal antibodies 
to keratins and to a carbohydrate epitope on this fully glycosylated mucin were used as controls and showed no reactivity 
30 a monoclonal antibody to p-galactosidase was a positive control and the band recognized correlated in every case 
with the band recognized by the specific antibodies. The sizes of the fusion proteins varied in proportion to the sizes 
of the cDNA inserts found in the bacteriophage. 

Characterization of cDNAs and RNA blot analysis 

35 

The inserts from the X clones were designated pMUC3-1 0 (omitting pMUC5) and were subcloned into the vector 
pUC 8 for easier manipulation. The 7 clones were compared to each other for sequence homology. Each of the plasmids 
was digested with EcoRI and the insert separated on a 1 .4% agarose gel. The largest cDNA insert from pMUCIO was 
used to probe the inserts and found to hybridize to all 6 inserts (Fig.11). pMUC 7 was found to contain two inserts 
40 following digestion with EcoRI; however, only 1 of the inserts hybridized to the pMUCI 0 probe. The insert bands were 

not derived from phage DNA since the pMUCIO probe did not hybridize to Hind Ill-digested X phage DNA 

As shown by agarose gel electrophoresis (Fig. 11), the inserts vary in size from about 200 to up to about 1800 bp. 
The largest insert from pMUCIO has been used as the hybridization probe in all subsequent experiments. 

Because the XMUC clones were identified only by antibody binding, we needed additional assurance that they 
45 were indeed coding for the breast epithelial mucin. To determine the authenticity of pMUCIO, we correlated the pres- 
ence of mRNA hybridizing to the clone with mucin expression in various cell lines. As shown in figure 12, the cDNA 
hybridized to two transcripts of 4.7 kb and 6.4 kb in the RNA from the breast cancer cell lines MCF-7 and T47D which 
were shown previously to express the HMFG-2 antigen (1,14). Significantly, the pMUCIO probe hybridized to transcripts 
of approximately the same size in RNA extracted from normal mammary epithelial cells cultured from milk (42). A third 
so band of 5.7kb can be seen in the RNA from these normal cells. In contrast, three human cell types that lack the mucin, 

breast fibroblasts, Daudi cells and HS578T, a carcinosarcoma line derived from breast tissue (43), showed no detect- 
able pMUCI 0-related mRNA. The 6.4 kb band appears to be the most adundantly expressed. The presence of at least 
two sizes of mRNA from MCF-7 cells correlates with the immu noprecipitation of two proteins of (molecular weights 68 
kd and 92 kd) from in vitro translated mRNA from MCF-7 cells. The normal mammary epithelial cells were derived from 
55 pooled milk samples and the additional transcript observed may be due to polymorphisms among individuals. 
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Genomic DNA blot hybridization and detection of a restriction fragment lenc 



i/morphism (RFLP) 



Genomic DNA was prepared from a panel of ten individuals consisting of six unrelated individuals and a family of 
four, and from three cell lines. The DNAs which were digested with Hinfl or EcoRI and blotted and hybridized to the 
5 radiolabelled pMUCI 0 insert, exhibit restriction fragment length polymorphisms. The restriction fragments from the ten 
individuals and three cell lines are shown in figure 1 3. The pattern consists of either a single band or a doublet of sizes 
ranging from 3400bp to 6200bp in the Hinfl digest (with the exception of the ZR75-1 DNA in lane 1 2, figure 1 3A which 
shows three bands) or from 8200bp to 9600bp in the EcoRI digest (Figure 13B), there appears to be a continuous 
distribution of the fragment sizes which implies a high in vivo instability at the locus. The pattern of fragments observed 
10 in the family of four (lanes 1 -4) suggests that these fragments are allelic. Preliminary studies of the DNA made from 
white blood cells of normal, related individuals indicate the existance of a number of independent alleles with an au- 
tosomal codominant mode of inheritance. These studies will be the subject of a separate investigation. 



Discussion 

The cDNA clones described here which were obtained from the MCF-7 Xgtl 1 library were selected using polyclonal 
and monoclonal antibodies prepared against a normal cellular product, the milk mucin in its deglycosylated form. This 
was done because it was easier to obtain large quantities of the mucin for stripping than to prepare similar quantities 
of immunologically related glycoproteins expressed by breast cancer cells (44). The fact that the antibodies did select 
for cDNA coding for nonglycosylated core protein molecules in MCF-7 cells, strongly suggests that the glycoproteins 
in these cells, which were originally detected by their reaction with antibodies to the milk mucin, contain the same core 
protein as this mucin. This is confirmed by the detection of mRNAs of approximately the same sizes in the normal and 
malignant cells, using one of the probes isolated from the MCF-7 library. We will therefore refer to the antibody reactive 
glycoproteins on breast cancer cells as mucins, bearing in mind that their processing may be different resulting in 
molecules of different molecular weights but with the same core protein as that of the milk mucin. 

Seven clones were obtained from the MCF-7 library of which the largest was ISOOkb. This clone cross hybridized 
with the other 6 smaller clones. The p-galactosidase fusion proteins expressed by six of the cross-hybridizing lambda 
clones were reactive with the polyclonal antiserum directed against the mucin core protein as well as with four well- 
characterized monoclonal antibodies directed to various epitopes on the stripped core protein, SM-3, SM-4, HMFG-1 
and HMFG-2 (14,20). The smallest lambda clone, XMUC9, produced a p-galactosidase fusion protein which reacted 
with three of the four monoclonal antibodies and with the polyclonal antiserum. 

The surprising result that the extensively characterized HMFG-1 and HMFG-2 monoclonal antibodies reacted 
strongly with the lambda plaques and the fusion proteins and could immunoprecipitate proteins from in vitro translated 
mRNA provides strong evidence that these clones do indeed code for a portion of the mucin core protein. Although 
previous evidence such as resistance to fixation, boiling, treatment with dithiothreitol and NaDodS0 4 and the presence 
of multiple epitopes on the molecule suggested that these were carbohydrate (1), it has now been established that the 
epitopes of the HMFG-1 and HMFG-2 monoclonal antibodies are definitely protein in nature. Carbohydrate may be 
required to obtain the strongest binding, either as part of the epitope or by conferring some conformational change on 
the protein portion, but part of the antigenic determinant must consist of an amino acid sequence. Since these two 
MAbs are reactive with the fully glycosylated milk mucin as well as the stripped core protein, this data means that the 
intact molecule contains areas of naked peptide which contribute to the antigenic sites for these two antibodies. 

Confirmatory evidence that pMUCI 0 codes for the mammary mucin core protein is provided by RNA blots. The 
relative abundance of mRNA in the breast cancer cell lines MCF-7, T47D, ZR-75-1 and in normal mammary epithelial 
cells corresponds to the antigen expression by these cells as measured by the binding of the HMFG-1 and HMFG-2 
monoclonal antibodies. Cell types which are negative for antigen expression such as human fibroblasts, Daudi cells 
and HS578T, a carcinosarcoma line derived from breast (14), are negative in RNA blot hybridizations. A fortuituous 
observation made with the ZR-75-1 cells yielded indirect strong evidence that pMUCI 0 does indeed code for the mucin 
glycoprotein core protein. This cell line, which routinely expresses large amounts both of mRNA and antigen, yielded 
one preparation of RNA which was unexpectedly negative by blot hybridization. It was subsequently found that those 
particular ZR-75-1 cells from which the RNA had been made had lost the expression of the antigen as well at this time 
(as determined by reaction with HMFG-1 and 2). Different passage numbers of the ZR-75-1 cells were recovered and 
shown once again to express both antigen and message. The sizes of the messages, 4.7 kb and 6.4 kb, are quite 
large, since a 68 kd or 92 Kd protein would need only about 3 kb to code for the protein portions. This suggests that 
a large portion of the mRNA maybe untranslated. Efforts are underway to obtain a full-length clone. 

Thus, the cDNA clones presented here represent a portion of the gene coding for the human mammary mucin 
which is expressed by differentiated breast tissue as well as by most breast cancers. The major proteins precipitated 
from in vitro translation products of RNA from MCF-7 cells by antibodies to the milk mucin core protein (68Kd) have 
an apparent molecular weight of 68Kd and 92Kd. These proteins, produced by the breast cancer cell therefore share 
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epitopes with the 68Kd core protein of the milk mucin (20). Whether a similar 92Kd protein is also produced by normal 
mammary epithelial cells, and is truncated or destroyed by HP treatment is not yet clear MCF-7 cells biosynthetically 
labelled with 14C amino acids yield upon immunoprecipitation with HMFG-1 and HMFG-2 antibodies, two glycosylated 
proteins of 320 kd and 430 kd and it is possible that each of these glycoproteins utilizes only one core protein of either 
5 68Kd or 92Kd. Alternatively, each of the glycoproteins could contain both the 92Kd and 68Kd proteins either in different 
proportions or variably glycosylated . Further screening of the library may yield full length cDNAs coding for both sizes 
of the immunological ly related core proteins. Since there appears to be only a single gene (based on Southern blot 
data obtained by using a partial cDN A probe), it is probable that the multiple messages arise by alternative RNA splicing 
and this would explain the fact that they contain common sequences. Although a core protein of 68 kd appears to be 
Jo small to yield a fully glycosylated molecule of greater than 300 kd which contains 50% carbohydrate, there is evidence 
that such a structure for mucins is possible. Ovine submaxillary mucin has a reported molecular weight of 1 x 10 6 
daltons (45), yet its protein core consists of 650 amino acids resulting in a molecule of 58 kd (46). 

The mucins which are detected with HMFG-t and HMFG-2 MAbs on immunoblots of tumours and breast cancer 
cell lines show variations in size from 80 kd to 400 kd in the molecular weights of the tumour mucin molecules (1 ,47). 
15 Using these same antibodies which detect high molecular weight mucins present in normal urine, a polymorphism has 
indeed been shown to be genetically determined (48). Although the very low molecular weight components are likely 
to represent precursor forms of the mucin which appears to be incompletely processed in many tumour cells (20), the 
variations in the higher molecular weight components are likely to be due to this genetic polymorphism. It was unclear, 
however, whether the structural basis of the polymorphism was due to the genetically determined protein or to the 
20 carbohydrate portion of the mucin. The detection of restriction fragment length polymorphisms in the Southern blotting 
experiments using the mucin probe suggest that the mucin polymorphism occurs at the level of the DNA which codes 
for the protein. Preliminary sequence data suggest that the basis for this polymorphism is a region of variable tandem 
repeats present in the protein coding sequences. This structural feature may be responsible for the generation of the 
many allelic restriction fragments at the mucin locus. We are presently investigating the basis of the mucin polymor- 
25 phism by a Southern blot curvey of DNA from white blood cells of normal, related individuals whose inheritance pattern 
of urinary mucins has been determined. In addition, we are examining DNA preparations made from the white blood 
cells and tumours of individual breast cancer patients to determine if there is any discordance between genotype in 
the paired samples, since tandemly repeated DNA may provide an unstable site where recombination or amplification 
could occur. 

30 The presence of mucins in the majority of carcinomas and their association with the differentiation of mammary 

epithelial cells makes it particularly important to identify regions involved in the tissue specific and developmental 
regulation of the gene. Moreover, the introduction of a functional mucin gene into cells should provide insights into the 
role of this molecule in breast epithelial differentiation and possibly enable us to identify any alterations in the function 
or expression of the mucin which are related to malignant transformation in the human breast. 
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The abbreviations are as follows: PBS, phosphate-buffered saline; MAb, monoclonal antibody; IPTG, isopropyl p- 
D-thiogalactoside; bp, base pair(s); Kb, kilobase(s). 

TABLE t 



Amino acid composition of the human milk mucin - comparison with PAS-O 



Amino acid 


HMFG-1 purified milk mucin 


Extensively stripped milk mucin 


PAS-O (Shimizu & Yamauchi 
1982) 


Asp 


6.1 


7.2 


6.4 


Thr 


9.4 


9.7 


9.8 


Ser 


9.1 


13.0 


13.1 


Glx 


6.3 


9.6 


8.3 


Pro 


14.8 


14.4 


12.0 


Gly 


8.1 


10.1 


12.2 


Ala 


12.3 


11.9 


13.0 


Cys 


Not analysed 


Not analysed 


0.5 


Val 


6.0 


6.3 


5.3 


Met 


0.5 


0.4 


0.8 


lie 


1.6 


1.7 


1.9 
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TABLE 1 (continued) 





Amino acid composition of the human milk mucin - comparison with PAS-0 


Amino acid 


HMFG-1 purified milk mucin 


Extensively stripped milk mucin 


PAS-O (Shimizu & Yamauchi 
1982) 


Leu 


4.5 


4.8 


3.7 


Tyr 


2.0 


0.9 


1.6 


Phe 


2.0 


.... 16 


1:7 


His 


3.2 


2.3 


3.8 


Lys 


2.8 


3.3 


2.2 


Arg 


4.0 


4.0 


3.9 



Table 2 



Reactivity of the antibodies on intact, partially and totally deglycosylated milk mucin 




125 l cpm bound 


Antibody 


Intact molecule 


Partially stripped mucin 


Totally stripped mucin 


5.17 


8,524 


11,925 


5,780 


9.13 


525 


3,000 


3,328 


SM-3 


465 


15,414 


9,200 


SM-4 


816 


16,750 


9,561 


HMFG-1 


32,000 


33,768 


9,494 


HMFG-2 


29,500 ■ 


29,230 


15,832 


NS2 medium 


397 


845 


650 



The binding of the antibodies to iodinated intact, partially and totally deglycosylated milk mucin was assayed using 
the protein A plate method as described in Materials and Methods. 

References 



1. Burchell, J.M., Durbin, H. and Taylor-Papadimitriou, J. (1983) J. Immunol. 131, 508-513. 

2. Bramwell, M.E., Ehavanandan, V.P., Wiseman, G. and Harris, H. (1983) Br. J. Cancer 48, 177-183. 

3. Mcllhinney, R.A., Patel, S. and Gore, M.E. (1985) Biochem. J. 227, 155-162. 

4. Hilkens, J., Buijs, F., Hilgers, J., Hagemann, Ph., Calafat, J., Sonnenberg, A. and Van der Valk, M. (1984) Int. 
J. Cancer 34, 197-206. 

5. Tagliabue, E., Porro, G., Barbanti, R, Della-Torre, G., Menard, S., Rilke , R, Cerasoli , S. and Colnaghi, M. (1986) 
Hybridoma 5, 107-115. 

6. Johnson, V.G., Schlom, J., Paterson, A.J., Bennett, J., Magnani, J.L. and Colcher, D. (1986) Cancer Res. 46, 
850-857. 

7. Sekine, H., Ohno, T. and Kufe, D.W. (1985) J. Immunol. 135, 3610-3615. 

8. Ormerod, M.G., Steele, K., Edwards, P.A.W. and Taylor-Papadimitriou, J. (1984) J. Expt. Path. 1, 263-271. 

9. Wilkinson, M.J., Howell, A., Harris, J., Taylor-Papadimitriou, J., Swindell, R. and Sellwood, R.A. (1984) Int. J. 
Cancer 33, 299-304. 

10. Kufe, D., Inghirami, G., Abe, M. ( Hayes, D., Justi-Wheeler, H. and Schlom, J. (1984) Hybridoma 3, 223-232. 

11 . Price, M.R., Edwards, S., Robins, R.A., Hilgens, J., Hilkens, J. and Baldwin, R. (1986) Eur. J. Can. Clin. Oncol. 
22, 115-117. 

12. Johnston, W.W., Szpak, C.A., Lottick, S.C., Thor, A. and Schlom, J. (1985) Cancer Res. 45, 1894-1900. 

13. Rasmussen, B.B., Pedersen, B.V., Thorpe, S.M., Hilkens, J., Hilgers, J. and Rose, C. (1986) Cancer Res. 45, 
1424-1427. 

14. Taylor-Papadimitriou, J., Peterson, J.A., Arklie, J., Burchell, J., Ceriani, R.L. and Bodmer, W.F. (1981) Int. J. 
Cancer 28, 17-21. 

15. Burchell, J., Wang, D. and Taylor-Papadimitriou, J. (1984) Int. J. Cancer 34, 763-768. 

16. Shimizu, M. and Yamaauchi, K. (1982) J. Biochem 91, 515-519. 

17. Ormerod, M.G., Steele, K., Westwood, J.H. and Mazzini, M.N. (1983) Br. J. Cancer 48, 533-541. 




EP 0 341 252 B1 



18. Taylor-Papadimitriou, J., lane, E.B. and Chang, S.E. (1983) In, Understanding Breast Cancer: Clinical and 
Laboratory Concepts. (Rich, M.A., Hager, J.C. and Furmanski, P. Eds,) Marcel Dekker, Inc. New York and Basel, 
pp. 215-246. 

19. Chang, S.E. and Taylor-Papadimitriou, J. (1983) Cell Diff. 12, 143-154. 

5 20. Example 1 above. 

21. Mort, A. and Lamport, D. (1977) Anal. Biochem. 82, 289-309. 

22. Wray, W., Bouiikas, T. t Wray, K.P. and Hancock, R. (1981) Anal. Biochem. 118, 197-203. 

23. Dubray, G. and Bezard, G. (1982) Anal. Biochem, 119, 325-329. 

24. Shearer, M., Taylor-Papadimitriou, J., Griffin, D. and Balkwill, F. (1984) J. Immunol, 133, 3096-31 01". 

io 25. Durbin, H. and Bodmer, W.F. (1986) J. Immunol. Meth., In Press. 

26. Lane, E.B. (1982) J. Cell Biol. 92, 665-673. 

27. Gooi, H.C., Jones, N.J., Hounsell, E.F., Scudder, R, Hilkens, J., Hilgers, J. and Feizi, T. (1985) Biochem. 
Biophys. Res. Commun. 131, 543-550. 

28. Chirgwin, J.M., Przybyla, A.E., MacDonald, R.J. and Rutter, W.J. (1979) Biochem. 18, 5194-5199. 

75 29. Walter, P., Green, S., Greene, G., Krust, A., Bornert, J.-M., Heltsch, J.-M., Staub, A., Jensen, E., Scrace, G., 

Waterfield, M. and Chambon, P. (1985) Proc. Natl. Acad. Sci. USA. 82, 7889-7893. 

30. Huynh, T.V., Young, R.A. and Davis, R.W. (1985) In, DNA Cloning: A Practical Approach, ed. Glover, D.M. 
(IRL, Oxford), Vol. 1, pp. 98-121. 

31. Young, R.A. and Davis, R.W. (1983) Proc. Natl. Acad. Sci. USA 80, 11 94-1198. 

20 32. Young, R.A. and Davis, R.W. (1983) Science 222, 778-782. 

33. Vieira, J. and Messing, J. (1982) Gene 19, 259-268. 

34. Young, R.A. and Davis, R.W. (1985) In Genetic Engineering, eds. Setlow, J.K. and Hollaender, A. (Plenum, 
New York), Vol. 7. 

35. Laemmli, U.K. (1970) Nature (Lond) 227, 680-685. 

25 36. Towbin, H., Staehelin, Y. and Gordon, J. (1979) Proc. Natl. Acad. Sci. USA. 76, 4350-4354. 

37. Maniatis, T., Fritsch, E.F. and Sambrook, J. (1 982) Molecular Cloning: A Laboratory Manual (Coldspring Harbor 
Laboratory, Cold Spring Harbor, NY)! 

38. Thomas, PA (1980) Proc. Natl. Acad. Sci. USA. 77, 5201-5205. 

39. Woodhead, J.L., Fallon, R., Figueiredo, H., Langdale, J. and Malcolm, A.D.B. (1986) In Human Genetic Dis- 

oo eases, ed. Davies, K.E. (IRL, Oxford), p. 56. 

40. Old, J.M. (1985) In Human Genetic Diseases, Ed. Davies, K.E., (IRL, Oxford), p.4. 

41. Feinberg, A.P. and Vogelstein, B. (1984) Anal. Biochem. 137, 265-267. 

42. Taylor-Papadimitriou, J., Purkis, P. and Fentiman, I.S. (1980) J. Cell. Physiol. 102, 317-321. 

43. Hackett, A.J., Smith, H.S., Springer, E.L., Owens, R.B., Nelson-Rees, W.A., Riggs, J.L. and Gardner, M.B ; 

35 (1977) J. Natl. Cancer Inst. 58, 1795-1800. 

44. Griffiths, A.B., Burchell, J., Taylor-Papadimitriou, J., Gendler, S.J., Lewis, A. and Tilly, R. (1987) Int. J. Cancer 
(submitted). 

45. Gottschalk, A., Bhargava, S. and Mury, V. In Gottschalk, A. (ed) Glycoproteins their composition, structure and 
function, pp, 810-829, Elsevier, New York. 

40 46. Hill, M.D.Jr., Reynolds, J.A. and Hill, R. (1977) J. Biol. Chem. 252, 3791-3798. 

47. Taylor-Papadimitriou, J., Millis, R., Burchell, J. t Nash, R., Pang, L. and Gilbert, J. (1986) J. Expt. Path. 2, 
247-260. 

48. Swallow, D., Griffiths, B., Bramwell, M., Wiseman, G. and Burchell, J. (1986) Disease Markers 3, In Press. 

49. Abe, M. and Kufe, D. Effects of maturational agents on expression and secretion of two partially characterized 

45 high molecular weight milk-related glycoproteins in MCF-7 breast carcinoma cells. J. Cell. Physiol. 126:126-136, 

. 1986. 

50. Arklie, J., Taylor-Papadimitriu, J., Bodmer, W:F., Egan, M. and Millis, R. Differentiation antigens expressed by 
epithelial cells in the lactating breast.are also detectable in breast cancers. Int. J. Cancer 28:23-29, 1981. 

51. Bolton, A.E. and Hunter, W.M. The labelling of proteins to high specific radioactivities by conjugation to a 125 l 

so containing acylating agent. Biochem. J. 133:529-538, 1973. 

52. Karlsson, S., Swallow, D., Griffiths, B., Corney, G. and Hoplinson, P. A genetic polymorphism of a human 
urinary mucin. Ann. Hum. Genet. 47:263-269, 1983. 

53. Kearney, J.F., Radbruch, A., Liesegang, B. and Rajewsky, K. A new mouse myeloma cell line that has lost 
immunoglobulin expression but permits the construction of antibody-secreting hybrid cell lines. J. Immunol. 123: 

55 1548-1550,1979. 

54. Melero, J. and Gonzales-Rodrigues, J. Preparation of monoclonal antibodies against glycoproteins 111a of 
human platelets. Eur. J. Biochem. 141:421-427, 1984. 

55. Clamp, J.R., Allen, A., Gibbons, R.A. and Roberts, G. Chemical aspects of mucins. Br. Med. Bull. 34:25-41, 



17 




EP 0 341 252 B1 



1978. 



Claims 

5 

1. A polypeptide recognisable by the monoclonal antibody SM3 secreted by hybridoma HSM3 (EC ACC 87010701), 
said polypeptide comprising at least 5 consecutive amino acid residues in a sequence encoded by the tandem 
repeat sequence of mRNA corresponding to the human polymorphic epithelial mucin core protein gene, which 
mRNA is obtainable from the human cancer cell line MCF-7, other than the high molecular weight glycoproteins 

to expressed by normal and malignant human epithelial cells, for use in a method of therapy practised on the human 

or animal body. 

2. A polypeptide according to claim 1 comprising at least 10 consecutive residues in a sequence encoded by the said 
tandem repeat. 

75 

3. A polypeptide according to claim 1 or claim 2 linked to a carrier protein. 

4. A polypeptide according to any one of claims 1 to 3 having the full sequence encoded by the tandem repeat 
sequence of the HPEM gene. 

20 

5 . A polypeptide according to claim 3 or claim 4, wherein at least one amino acid residue bears a linkage sugar 
substituent. 

6. A polypeptide according to claim 5, wherein the linkage sugar bears an oligosaccharide moiety. 

25 

7. A polypeptide according to claim 5 or claim 6, wherein each amino acid bearing a substituent is a serine or threonine 
and the linkage sugar is N-acetyl galactosamine. 

8. A polypeptide according to any one of claims 1 to 7 bearing a detectable label or a therapeutically effective moiety. 

30 

9. Use of a polypeptide recognisable by the monoclonal antibody SM3 secreted by hybridoma HSM3 (ECACC 
87010701), said polypeptide comprising at least 5 consecutive amino acid residues in a sequence encoded by the 
tandem repeat sequence of mRNA corresponding to the human polymorphic epithelial mucin core protein gene, 
which mRNA is obtainable from the human cancer cell line MCF-7, other than the high molecular weight glycopro- 

35 teins expressed by normal and malignant human epithelial cells, in the production of a medicament for use as an 

immunogen in a method of cancer therapy practised on the human or animal body. 

10. Use of a polypeptide recognisable by the monoclonal antibody SM3 secreted by hybridoma HSM3 (ECACC 
87010701 ), said polypeptide comprising at least 5 consecutive amino acid residues in a sequence encoded by the 

40 tandem repeat sequence of mRNA corresponding to the human polymorphic epithelial mucin core protein gene, 

which mRNA is obtainable from the human cancer cell line MCF-7, other than the high molecular weight glycopro- 
teins expressed by normal and malignant human epithelial cells, in a method of in vitro diagnosis. 

11. Use according to claim 9 or claim 10 of a polypeptide as defined in any one of claims 2 to 8. 

45 



Patentansprucho 

1. Polypeptid, das durch den monoklonalen Antikorper SM3, der von dem Hybridom HSM3 (ECACC 87010701) 
sekretiert wird, erkannt werden kann, wobei das Polypeptid wenigstens 5 aufeinanderfolgende Aminosaurereste 
in einer Sequenz umfaBt, die von der Tandem-Wiederholungssequenz der mRNAcodiert wird, die dem humanen 
polymorphen epithelialen Mucin-Core-Proteingen entspricht, wobei die mRNA von der humanen Krebszellinie 
MCF-7 erhalten werden kann, wobei das Polypeptid anders ist als die Glykoprotelne mit hohem Molekulargewicht, 
die von normalen und malignen humanen epithelialen Zellen exprimiert werden, zur Verwendung in einem The- 
rapieverfahren, das am menschlichen Oder tierischen Korper angewandt wird. 

2. Polypeptid nach Anspruch 1, umfassend wenigstens 10 aufeinanderfolgende Reste in einer Sequenz, die von der 
Tandem-Wiederholung codiert wird. 
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3. Polypeptid nach Anspruch 1 Oder Anspruch 2, das an ein Tragerprotein gebunden ist. 

4. Polypeptid nach einem der Anspruche 1 bis 3, das die vollstandige Sequenz, die von der Tandem-Wiederholungs- 
sequenz des HPEM-Gens codiert wird, besitzt. 

5 

5. Polypeptid nach Anspruch 3 Oder Anspruch 4, dadurch gekennzelchnet, daB wenigstens ein Aminosaurerest 
einen Bindungszuckersubstituenten tragt. 

6. Polypeptid nach Anspruch 5, dadurch gekennzelchnet, daB der Bindungszucker einen Oligosaccharidtell tragt. 

10 

7. Polypeptid nach Anspruch 5 Oder Anspruch 6, dadurch gekennzelchnet, daB jede Aminosaure, die einen Sub- 
stituenten tragt, ein Serin Oder Threonin ist und daB der Bindungszucker N-Acetylgalactosamin ist. 

8. Polypeptid nach einem der Anspruche t bis 7, welches eine nachweisbare Markierung Oder einen therapeutisch 

is wirksamen Teil tragt. 

9. Verwendung eines Polypeptides, das von dem monokionalen Antikorper SM3, der von dem Hybridom HSM3 
(ECACC 87010701 ) sekretiert wird, erkennbar ist, wobei das Polypeptid wenigstens 5 aufeinanderfolgende Ami- 
nosaurereste in einer Sequenz umfaBt, die von der Tandem-Wiederholungssequenz der mRNA codiert wird, die 

20 dem humanen polymorphen epithelialen Mucin-Core-Proteingen entspricht, wobei die mRNA von der humanen 

Krebszellinie MCF-7 erhalten werden kann, wobei das Polypeptid anders ist als die Glykoproteine mit hohem 
Molekulargewicht, die von normalen und malignen humanen epithelialen Zellen exprimiert werden, zur Herstellung 
eines Medikamentes zur Verwendung als ein Immunogen in einem Verfahren der Krebstherapie, welches am 
menschlichen Oder tierischen Korper durchgefuhrt wird. 

25 

10. Verwendung eines Polypeptides, das von dem monokionalen Antikorper SM3, der von dem Hybridom HSM3 
(ECACC 87010701) sekretiert wird, erkennbar ist, wobei das Polypeptid. wenigstens 5 aufeinanderfolgende Ami- 
nosaurereste in einer Sequenz umfaBt, die von der Tandem-Wiederholungssequenz der mRNA codiert wird, die 
dem humanen polymorphen epithelialen Mucin-Core-Proteingen entspricht, wobei die mRNA von der humanen 

30 Krebszellinie MCF-7 erhalten werden kann, wobei das Polypeptid anders ist als die Glykoproteine mit hohem 

Molekulargewicht, die von normalen und malignen humanen epithelialen Zellen exprimiert werden, in einem Ver- 
fahren zur ]n_vitro- Diagnose. 

11. Verwendung eines wie in einem der Anspruche 2 bis 8 definierten Polypeptids nach Anspruch 9 oder Anspruch 10. 

35 

Revendlcations 

Un polypeptide reconnaissable par I'anticorps monoclonal SM3 secrete par Phybridome HSM3 (ECACC 
87010701), ledit polypeptide comprenant au moins 5 residus d'acides amines consecutifs dans une sequence 
codee par la sequence repetee en tandem de PARNrn correspondant au gene de la proteins de cors de la mucine 
epithelial© polymorphe humaine, lequel ARNm peut etre obtenu k partir de la lignee de cellules cancereuses 
humaines MCF-7, autre que les glycoproteines de haut poids molbculaire exprim6es par des cellules bpitheliales 
humaines normales et malignes, pour utilisation dans une methods therapeutique pratiquee sur le corps humain 
ou animal. 

Un polypeptide selon la revendication 1, comprenant au moins 10 residus consecutifs dans une sequence codes 
par ladite sequence rbp6t6e en tandem. 

Un polypeptide selon la revendication 1 ou la revendication 2, 116 k une prot6ine support. 

Un polypeptide selon Tune quelconque des revendications 1 k 3, ayant la sequence complete codee par la se- 
quence r6p6t6e en tandem du gene HPEM. 

Un polypeptide selon la revendication 3 ou la revendication 4, dans lequel au moins un residu d'acide amine porte 
un substituant sucre de liaison. 

Un polypeptide selon la revendication 5, dans lequel le sucre de liaison porte un groups oligosaccharidique. 



1 . 

40 

45 

2 . 

so 3. 

4. 

55 5 . 

6 . 



19 




EP 0 341 252 B1 



7. Un polypeptide selon la revendication 5 on la revendication 6, dans lequel chaque acide amin6 portant un subs- 
trtuant est une serine ou une threonine et le sucre de liaison est la N-ac6tyl galactosamine. 

8. Un polypeptide selon Tune quelconque des revendications 1 & 7, portant un marqueur detectable ou un groupe & 

5 effet th^rapeutique. 

9. Utilisation d'un polypeptide reconnaissable par Panticorps monoclonal SM3 secrete par I'hybridome HSM3 (ECACC 
87010701), ledit polypeptide comprenant au moins 5 residus decides amines cons^cutifs dans une sequence 
codee par la sequence repetee en tandem de I'ARNm correspondarit au gene de la proteine de cors de la mucine 

io epithelial© polymorphe humaine, lequel ARNm peut etre obtenu & partir de la Iign6e de cellules cancereuses 

humaines MCF-7, autre que les glycoproteines de haut poids mol^culaire exprimSes par des cellules epitheliales 
humaines normales et malignes, dans la production d'un medicament pour utilisation comme immunogene dans 
une methods de therapie du cancer pratiquee sur le corps humain ou animal. 

15 10. Utilisation d'un polypeptide reconnaissable par I'anticorps monoclonal SM3 secrete par I'hybridome HSM3 (ECACC 

87010701), ledit polypeptide comprenant au moins 5 residus d'acides amines consecutifs dans une sequence 
codee par la sequence repetee en tandem de I'ARNm correspondant au gene de la proteine de cors de la mucine 
epithelial© polymorphe humaine, lequel ARNm peut etre obtenu a partir de la lignee de cellules cancereuses 
humaines MCF-7, autre que les glycoprot6ines de haut poids moteculaire exprim^es par des cellules 6pith6lia!es 

20 humaines normales et malignes, dans une methode de diagnostic in vitro. 

11. Utilisation selon la revendication 9 ou la revendication 10 d'un polypeptide tel que defini dans I'une quelconque 
des revendications 2 & 8. 

25 
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Fig.6.(Cont.d) 
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Fig. 6. (Corf, d) 
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Fig.6.(Cont.d) 
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Fig. 10. 
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Fig.13A. Fig. 13B. 
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